Temperature Dependent Conformational Transitions and Hydrogen Bond Dynamics 
of the Elastin-Like Octapeptide GVG(VPGVG): a Molecular Dynamics Study 
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A joint experimental / theoretical investigation of the elastin-like octapeptide GVG(VPGVG) 
was carried out. In this paper a comprehensive molecular dynamics study of the temperature 
dependent folding and unfolding of the octapeptide is presented. The current study, as well as 
its experimental counterpart (see previous paper) find that this peptide undergoes an "inverse 
temperature transition", ITT, leading to a folding at about 40-50° C. In addition, an unfolding 
transition is identified at unusually high temperatures approaching the boiling point of water. Due 
to the small size of the system two broad temperature regimes are found: the "ITT regime" (at 
about 10-50°C) and the "unfolding regime" at about T > 60°C, where the peptide has a maximum 
probability of being folded at T ~ 60° C. A detailed molecular picture involving a thermodynamic 
order parameter, or reaction coordinate, for this process is presented along with a time-correlation 
function analysis of the hydrogen bond dynamics within the peptide as well as between the peptide 
and solvating water molecules. Correlation with experimental evidence and ramifications on the 
properties of elastin are discussed. 

Keywords: Elastin, molecular dynamics, inverse temperature transition, hydrogen bond dynamics. 
Abbreviations used: AFM, Atomic Force Microscopy; DSC, Differential Scanning Calorimetry; CD, Cir- 
cular Dichroism; FT-IR, Fourier Transform Infrared; NMR, Nuclear Magnetic Resonance; MD, Molecular 
Dynamics; HB, Hydrogen Bond; ITT, Inverse Temperature Transition; PCA, Principal Component Analysis; 
PPC, Pressure Perturbation Calorimetry. 



I. INTRODUCTION 



Vertebrate elastic fibers, as contained in vascular walls, 
skin or lungs, allow for reversible deformations upon me- 
chanical stress. These fibers consist of two main types 
of protein components: a fibrous component, fibrilen, 
and an amorphous component, the globular elastin pro- 
tein 0. The essential elasticity is provided by the lat- 
ter protein, elastin, which is known to have very unique 
viscoelastic properties in the water swollen state (see 
Refs. 0, I S H i for recent overviews). The in- 
soluble elastin is an extensively crosslinked polymer of 
a precursor tropoelastin. Tropoelastin is composed of 
two types of domains. One of them is rich in lysine 
and provides cross-linking of the individual monomers 
resulting in lysinonorleucin, desmosine and isodesmosinc 
links, which are responsible for its typical yellow color. 
The other is made predominantly of hydrophobic amino 
acids with the highly repetitive pentameric repeat unit 
"VPGVG" (amino acids V: valine, P: proline, G: glycine) 
which has been found to be crucial to elastin's function- 
ality Hani. 

An interesting peculiarity of both elastin and tropoe- 
lastin, is that they undergo folding by increasing the 
temperature beyond typically 25°C. The term "inverse 
temperature transition" (ITT) was coined for this appar- 
ently paradoxical change from a "disordered" (extended) 
to an "ordered" (folded) conformation upon heating. A 
striking manifestation of this phenomena is the ability to 
grow crystalline solid state structures from a solution of 
cyclic elastin-like oligopeptides by heating [ToL 11]. As 



such, tropoelastin and its synthetic analogues, have been 
a subject of intense investigation in the field of biopoly- 
mers and protein engineering. Elastin-like polymers have 
the promise of providing materials for bio- mechanical de- 
vices 0, II, IT^las well as temperature dependent molecu- 
lar switches Despite the intense research efforts, the 
detailed structure of tropoelastin still defeats elucidation. 
Hence a molecular level picture which relates the protein 
structure to the viscoelastic properties is still a matter of 
debate. What is beyond doubt, however, is the decisive 
role of water as a plasticizer - dry elastin is brittle 0. 
Still, the potentially crucial aspect of the protein's hydra- 
tion water remains largely unexplored with the notable 
exception of recent MD simulations 

The origin of the viscoelastic properties of elastin is 
controversially discussed, calling concepts such as clas- 
sical rubber elasticity |15|. various librational entropy 
mechanism ^ a El- an( i multi-phase models 0, Il4j . 
Much of the experimental data has been summarized in 
excellent reviews 0001 on both elastin and elastin-like 
polypeptides. There is much data that suggest that both 
elastin and its synthetic mimics display an interesting 
conformational dynamics in solution. Early NMR stud- 
ies suggest that elastin under physiolo gica l conditions is 
composed of highly mobile chains [Til Il9j|. which corre- 
lated well with the observation from birefringence [2(j 
measurements that elastin, in the same state, is isotrop- 
ically distributed with the chains adopting random con- 
formations. Studies of the thermo-mechanical proper- 
ties of water swollen elastin and tropoelastin are con- 
sistent with the interpretation of elastin as a classical 
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rubber with heavy emphasis on the role of entropic con- 
tributions to elasticity 15]. These findings would sup- 
port a single phase model where the structure of elastin 
is random and undergoes large amplitude fluctuations. 
Recent single-molecule AFM and spectroscopic measure- 
ments .?] studies have severely challenged this interpre- 
tation and suggest that, above the ITT temperature, a 
structurally ordered model of poly(VPGVG) can also ac- 
count for these elastic properties due to significant pro- 
tein librational contributions to the overall entropy as 
suggested two decades ago j^. Other spectroscopic ex- 
periments such as FT-IR, NMR, CD and Raman mea- 
surements [ij I22I y~J 129 on elastin-like polypep- 
tides suggest the presence of (3- and 7-turns as a common 
structural motif which may be in dynamic equilibrium 
and very short and/or distorted antiparallel /3-strands 
and disordered structures with possible dynamic sheet- 
coil-turn transitions. Recent NMR studies also point 
to the high mobility of the elastin chains in the water 
swollen elastins [27| and in solution multiple tempera- 
ture states for poly(GVGVP), each with different dy- 
namical behavior and peptide-water interactions, were 
observed [2^]. These later data would rather support a 
multiphase model of elastin Q where the fine details of 
the protein structure and in particular its interactions 
with water play a crucial role in understanding its elas- 
ticity. Overall, there is a wealth of data on the structure 
and properties of these systems, and although the mod- 
els do not agree on their interpretation, a central ingredi- 
ent is the intriguing and complex dynamical behavior of 
elastin-like polymers themselves and the crucial interplay 
with solvating water molecules. 

Atomistic simulations on elastin or elastin-analogues 
are scarce. Early work (see Ref. [3j for a review) , such as 
Ref. 0], studied the elastic properties of these proteins 
by performing short MD simulations under constraints 
in order to mimic an external pulling force. Here the au- 
thors found that entropic forces from reduced librational 
motion of the protein were indeed a main contributing 
factor in the elastic behavior. However, the authors were 
understandably limited to simulation times not exceeding 
one nanosecond. It is noted that, in general, short simu- 
lation times generally lead to unreliable statistics, due to 
insufficient sampling of configuration space, and hence 
may provide only qualitative insights. A crucial factor 
also to be considered is the explicit atomistic treatment 
of the protein/water interface and characterization of its 
role in determining elastin's properties. As a significant 
advance along these lines, recent studies on solvated 90 
amino acid polypeptides of sequence (VPGVG^g, have 
been reported, which explicitly include water 14]. The 
importance of the protein/water interface was worked out 
based on thermodynamic and static structural - as op- 
posed to dynamic - considerations. This work was suc- 
cessfully able to reproduce the ITT, clarify how water 
may contribute to the entropic contribution to the elas- 
tic forces |2{| as well as probe the role of elastin mimics 
as potential molecular machines |30j |. Overall this work 



was was successfully able to qualitatively relate its find- 
ings to much of the experimentally observed phenomena, 
though, some debate has arisen with regards to the abil- 
ity of the proposed structures to account for the tem- 
perature dependence of elastin's water content above the 
ITT ■ Despite the significant improvement these simu- 
lations have brought to our understanding of the atomic 
level details concerning elastins' behavior, these quite 
long biopolymers were simulated on the order of a few 
nanoseconds for a given temperature. In view of the ex- 
pected long relaxation times of proteins of that size [3l| 
it would be highly desirable to have access to simulation 
times that allow for quantitative statistical-mechanical 
analysis for a given number of amino acids. This is es- 
pecially true for low frequency oscillations of the protein 
backbone which play a central role in many models of 
elastin's properties. Furthermore, the dynamics and in 
particular the kinetics of hydrogen-bonds remains unex- 
plored up to now. Thus, questions regarding the coupling 
of the dynamics of the protein, the protein/water inter- 
face and hydrogen bonding remain open issues. 

A significant experimental finding was the re- 
cent demonstration [9j that oligopeptides of the kind 
GVG(VPGVG)„ display the ITT even in the limit of 
only one pentameric repeat unit. Using the limiting 
value n = 1 opens up the possibility to perform MD 
simulations that allow for fairly long simulation times 
with sufficiently many solvating water molecules. We 
therefore have launched a joint experimental / simula- 
tion study of the smallest possible elastin model, the oc- 
tamer GVG(VPGVG) (having only « 640 Da), see the 
preceding paper |25| and Ref. |24| for a preliminary short 
communication. This minimal elastin model was cho- 
sen, at the expense of experimental difficulties, in order 
to allow for extensive molecular dynamics (MD) simu- 
lations, at 12 temperatures between 280 K-390 K, with 
a thorough conformational sampling of 32 nanoseconds 
each (after equilibration), followed by in-depth statisti- 
cal mechanical analysis with a focus on dynamics. The 
current approach will allow us to obtain a crucial link 
between the multitude of experimental results and the 
atomistic simulations by directly comparing results ob- 
tained on essentially the same system under thermody- 
namically consistent conditions. Moreover, the detailed 
statistical mechanical analysis which is possible on the 
current MD simulations will provide a complementary 
perspective on simulation work already existing in the 
literature. 



II. RESULTS AND DISCUSSION 

A. Time evolution and Temperature Dependence 
of Structural Parameters 

In order to get a first impression of the dynamics of 
the solvated octapeptide GVG(VPGVG) we consider the 
time evolution of a few selected quantities describing the 
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FIG. 1: Time evolution of selected structural quantities of 
GVG(VPGVG) in water at 330 K: (a) Distance from C- to 



N-terminus, rc-N, (b) Radius of gyration, r. 
of gyration of the valine side groups, r^l 



(c) Radius 
(d) Number of 
peptide-peptide HBs, n pp , (e) Number of waters bridging two 
amino acids of peptide by HBs , rib w , (f) Number of HBs be- 
tween peptide and solvating water, n ps , (g) Projection of the 
trajectory onto the first eigenvector of the covariance matrix, 
mi; see text for definitions. Only for presentation purposes 
the functions were denoised using the Savitzky-Golay proce- 
dure [3^1 where a time window of 128 ps and a polynomial 
of 6th degree was used, whereas the analysis was carried out 
based on the original data sets. 





(b) 

FIG. 2: Graphical representation of typical protein confor- 
mations at 330 K including only bridging waters, (a) Open 
state, (b) Closed state. 



overall structure. The first parameter of interest is the 
distance from the C- to the N-terminus, rc-N, which is 
a measure of the extension of this short polymer chain. A 
typical time evolution profile of this quantity is presented 
in Figure nja) at a temperature of 330 K. Judging from 
this parameter there are two distinctly different types of 
peptide structure: the majority of the time rc-N oscil- 
lates about a distance of 17 A (corresponding to "open 
structures") while a second, less probable configuration, 
is found when rc-N ~ 5 A (stemming from "closed struc- 
tures"). Representative configurations, as obtained at 
330 K for each type of conformation, are depicted in Fig- 
ure 12 The presence of two types of peptide conforma- 
tions is also mirrored by the radius of gyration of the 
biopolymer, r gyr , which is a standard measure of a poly- 
mer's overall size. This parameter is strongly correlated 
with the end-to-end distance rc-Nj see Figure^b), such 
that for the extended structures r gyr w 8 A whereas for 
the closed ones r gyr w 4 A. As a simple measure of the 
proximity of hydrophobic side groups of the peptide we 
also consider the radius of gyration of the valine side 
groups only, r^, see Figure dc). Again, this parameter 
is found to be strongly, although not perfectly, correlated 
with the end-to-end distance and, as expected, the radius 
of gyration of the entire peptide. 

The open and closed structures have distinctly dif- 



ferent hydrogen bond (HB) arrangements as indicated 
by changes in the numbers of HBs of different classes. 
Specifically, we consider the number of HBs within the 
peptide that directly connect amino acid donor (NH) and 
acceptor (CO) groups, n pp , the number of HBs between 
the peptide and water molecules in its solvation shell, n ps , 
and the number of water molecules that bridge differ- 
ent parts of the peptide chain by simultaneously forming 
HBs to two or more non-consecutive amino acid residues, 
iibw, see Figure |2 According to Figure ^ these quanti- 
ties are again found to be strongly correlated to rc-N- ln 
the closed state peptide/peptide HBs and water bridges 
exhibit values of around n pp 2 — 3 and ribw ~ 2, re- 
spectively, whereas peptide/solvation water interactions 
amount to about n ps w 12 HBs. This is quite differ- 
ent in the extended state: there are few peptide/peptide 
HBs, n pp w 0, and bridging waters, ribw ~ 0, but about 
n ps w 16 peptide/solvation water HBs. These simple pa- 
rameters - as well as others not depicted here such as 
the moments of inertia - indicate several opening and 
closing events in this trajectory and thus suggest that 
the present MD simulations are sufficiently able to sam- 
ple the conformational space available to the peptide to 
distinguish between two very different types of structure 
which are found to be in dynamic equilibrium. 

A similar picture where one observes two types of 
structures, open and closed, with correlated fc-Nj r gyr 
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and HB types is also found in simulations where the 
thickness of the solvating water sphere is increased as 
well as with alternate peptide chain capping groups at se- 
lected temperatures of 280 and 320 K (see Appendix IA 1(1 . 
This gives us confidence that this scenario is not an aber- 
ration of our computational methodology. Moreover, in 
agreement with Ref. this latter finding suggests that 
the results should depend on only the presence of the 
repeat unit VPGVG itself. We note in passing that the 
time scale of the peptide backbone fluctuations which in- 
terconvert open and closed conformations is on the order 
of about one nanosecond (see Sect. Iff Bl and in particular 
Figure 5(b) for details) even for this minimalistic elastin 
model! The implications of this time scale is that for MD 
trajectories on the order of only a few nanoseconds sta- 
tistical quantities, which depend on the conformation of 
the peptide backbone, will be subject to large errors and 
hence should be conservatively interpreted, in particular 
if longer chains, yielding even longer relaxation times, are 
investigated. 

We now consider how this dynamic equilibrium be- 
haves as a function of temperature. Given the above 
description of the change in HB configuration as a func- 
tion of the type of structure, it is convenient to define 
the number of internal hydrogen bonds, nmB as the sum 
of ?ipp and the HB arising from the bridging waters, rtbw 
Note that both n pp and ribw exist only when the struc- 
ture is closed. An average over the entire trajectory will 
result in very small numbers with large associated errors, 
whereas, nmB is a more statistically meaningful quantity 
carrying similar information. The data from the time 
averages < r C -N >, < r gyr >, < n ffl B >, < n ps >, 
and < r^l > are plotted in Figure [31 as a function of 
temperature. As expected from the above correlations 

< ?*c-n >, < fgyr >, < «ihb >, and < r^. 1 > display a 
similar temperature dependence which may be classified 
in terms of two distinct "temperature regimes" below and 
above a temperature close to 330 K. The average end- 
to-end distance increases slightly from 280 K to 310 K, 
where a significant decrease of < 7'c-n > sets in with 
a maximum contraction at 330 K, followed by a rise in 
value up to 390 K. Similarly, < nmB > and < r gyr > 
mirror this trend. The average of the radius of gyration 
stemming from the valine groups only, < r"jj >, shows 
also an initial decrease, but at variance with the total 
radius of gyration it remains small up to high temper- 
atures subject to sizable fluctuations. Below 330 K the 
structures have slightly larger r gyr and only a few inter- 
nal HBs, which is consistent with only a small percent- 
age of closed structures. The largest number of closed 
structures is observed at 330 K resulting in the smallest 

< rgy r > and the largest < niHB >■ Above this temper- 
ature, < rgyr > slowly increases in parallel to a decrease 
in < niHB >, which is consistent with a decrease in the 
number of closed structures. Note in passing that both 

< n pp > and < nb w > also follow this same trend, al- 
though with much larger numerical noise. Overall, our 
picture not only is consistent with the interpretation of 
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(d) 




(e) 

FIG. 3: Average quantities as a function of temperature: (a) 
Distance from C- to N-terminus, rc-N (circles), (b) Radius 
of gyration, r gyr (squares), (c) Radius of gyration of valine 
side groups, r^ 1 (triangles down), (d) Number of HBs be- 
tween peptide and solvating water, n ps (triangles up), (e) 
Number internal HBs, hihb (diamonds); see text for defini- 
tions. Dashed lines are linear connections of the data to guide 
the eye. 



an ITT occurring at around 330 K but we also find a slow 
trend reversal at higher temperatures. 

As a measure of how the peptide/ water interface be- 
haves during these structural transitions the total num- 
ber ofinterfacial HBs, < n ps >, is considered, see Fig- 
There is a continuous decrease in < n 



3(d) 



> 



observed upon increasing the temperature with a pro- 
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nounced kink at around 330 K. This hints that the 
peptide/water interface also exhibits two temperature 
regimes as already observed for the structural quantities 
associated with the peptide structure. It is noted, how- 
ever, that this trend obtained from averaging the struc- 
tural quantity n ps is much less pronounced and is exam- 
ined in greater detail in Sect. Ill Cl bv dynamical analysis 
of the peptide/water interfacial HBs. 



Overall, these averaged structural quantities provide a 
suggestive, though by no means comprehensive, picture 
of the atomic level details of the peptide structural tran- 
sitions. The presence of two types of structures, open and 
closed, correlates extremely well with the observation of 
an isodichroitic point in the CD spectra [H, |2f| and sup- 
ports the description of this small peptide as behaving 
like a two-state system. The increase of the contribution 
of closed structures, at temperatures up to 330 K and 
its subsequent decrease at higher temperatures is con- 
sistent with the temperature dependence of more com- 
pact structures containing 7 and (3 turns as observed in 
the FT-IR spectra of the complementary experimental 
investigation [2j|. The DSC and PPC thermodynamic 
measurements presented in the joint experimental study 
may also be interpreted as the peptide ultimately re- 
verting to a structurally and thermodynamically simi- 
lar state at temperatures near the boiling point of wa- 
ter. In addition, the above scenario is qualitatively in 
accord with that presented in Ref. which describes 
the ITT using a similar but much longer model peptide. 
Specifically this study also finds an increasing number 
of peptide/peptide HBs, an overall average contraction 
of the peptide at the ITT, a decreasing number of in- 
terfacial water molecules and an increasing number of 
bridging waters which may be thought of as the internal 
waters in our small elastin-like analogue. The authors 
of Ref. attribute this shrinkage to a "hydrophobic 
collapse" which is in agreement with our observed tem- 
perature dependence of < n ps > and < r^] >. However, 
the current picture suggests the subtle but crucial differ- 
ence that there is a net exchange of peptide/peptide and 
bridging water intramolecular HBs for peptide/water in- 
terfacial HBs; the energetic implications of the various 
HB types are addressed in Sect III CI At this point we 
stress that our finding and its interpretation has a strik- 
ing resemblance to the recently advanced description of 
self-association of methanol in water whereby the 
arrangement of hydrophilic-hydrophilic interactions are 
assumed to determine the structure as opposed to en- 
tropic hydrophobic-hydrophilic interactions. In the fol- 
lowing two subsections these suggestive findings, which 
are based on simple descriptors, will be further scruti- 
nized by a detailed statistical-mechanical analysis. 



B. Principal Component Analysis and Order 
Parameter 



To probe the above findings in a systematic but simple 
way one would ideally wish to describe these phenomena 
by a single parameter, i.e. by an order parameter or re- 
action coordinate. To explore this possibility we have 
employed what is called covariance, essential dynam- 
ics, or principal component analysis (PCA) 0, |3f| |3|| 
(see Appendix IA 31 for a short introduction). In this 
approach one computes, by time averaging, < ... >, 
over the entire MD trajectory the covariance matrix 
C =< [x(i)- < x >] • [x(t)- < x >] T >, where x(t) 
are the Cartesian coordinates of the peptide atoms at 
time t in a frame of reference where the overall transla- 
tions and rotations of the polymer have been subtracted. 
The 3./V — 6 normalized eigenvectors {m^} with nonzero 
eigenvalues {Xi} provide a basis in which the complex 
polypeptide motion may be decomposed; more techni- 
cal details are compiled in Appendix IA 31 This then 
provides a theoretical framework which can be though 
of being analogous to harmonic normal modes that are 
of ubiquitous use to explain spectroscopic observations 
of small molecules. In practice this allows us to disen- 
tangle relevant peptide backbone folding motions from 
small-amplitude vibrations via projecting the deviations 
of the MD trajectories from the average structure onto 
the eigenvectors, i.e. rhi — [x(i) — < x >]-nij. Ideally, for 
uncorrelated small-amplitude vibrations the correspond- 
ing projections rhi will simply fluctuate about an aver- 
age value yielding an approximately Gaussian distribu- 
tion function, P(rhi); the width might be temperature- 
dependent akin to quasi-harmonic molecular vibrations. 
Strong deviations from such a unimodal behavior would 
single out "interesting modes" that can be associated to 
collective changes involving many atoms. Upon changing 
the temperature the various modes are expected to mix, 
in particular the higher-order modes. Thus, a common 
basis {rrii} has to be chosen to ensure a consistent analy- 
sis. In the following, projections {rhi} will be performed 
onto the basis of eigenmodes of the low-temperature tra- 
jectory at 280 K. 

Upon projecting the MD trajectories onto the eigen- 
vector with the largest eigenvalue Ai it is found that this 
mode, mi, actually describes the opening and closing 
motion of the octapeptide, and is present at all temper- 
atures, see Figure |4(a)| for a graphical illustration. In 
Figure njg), we depict this projection fh\ as a function 
of time at 330 K where it can be inferred to be perfectly 
correlated with the other parameters already discussed 
in the previous section. The octapeptide thus exists in 
a closed state rhi ~ —22 A with end-to-end distances 
of about 5 A and an extended state rhi ~ +5 A and 
end-to-end distances of about 17 A. In addition, the 
projection process yields for rhi a bimodal distribution 
function P(mi) at low temperatures, whereas it adopts 
a unimodal, but very broad and skewed shape upon heat- 
ing, see Figure 4(c) 
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FIG. 4: Results of the Principal Component Analysis: (a) 
Visualization of large amplitude opening and closing mode, 
mi (b) Visualization of large amplitude librational mode, rri2 
(c) Distribution function of the projection rh\ at 280, 330, 360 
and 390 K. (d) Distribution function of the projection rfi2 at 
280, 330, 360 and 390 K. (e) Relative free energy along the 
projection fh\. (f) Relative free energy along the projection 
rfi2- In (a) and (b) an artifical trajectory using the full first 
mi, (a), and second rri2, (b), eigenvector was synthesized for 
a suitable graphical presentation. In (e)-(f) 280 K(squares), 
330 K (triangles) 380 K (diamonds) and 390 K (circles). For 
presentation purposes the free energy profiles for 330, 380 
and 390 K were shifted in energy by 5, 10 and 15 kJ-mol , 
respectively. 



The mode with the next largest eigenvalue, m2, can 
be classifie d as a peptide backbone librational mode, 
see Figure 4(b) for a sketch. During this motion the 
peptide backbone undergoes twisting which is actually 
very similar in spirit to that described for the defor- 
mation of /3-turns in Refs. 0, However, in con- 
trast to the lowest-order projection mi, rfi2 as well as 
all other higher-order mode projections of the peptide 
backbone are characterized by fairly symmetric, narrow 
and unimodal distributions P(rhi) that are essentially 
temperature-independent, see Figure 4(d) This implies 
that - as far as the interpretation of the behavior of the 
folding transitions are concerned - we are able to neglect 
all backbone librational motions and reduce our consid- 
erations down to only a single degree of freedom: mi. 
Thus, we may confidently employ the corresponding pro- 
jection 77i i as a "natural" many-body collective reaction 
coordinate, or order parameter, for investigating the fold- 
ing transition dynamics of GVG(VPGVG) in water. 

Having obtained a proper order parameter distribution 



function, P(mi), allows one to readily define an effective 
relative free energy profile (or "potential of mean force" ) 
according to AF(rai) = — ksT \n[P(rhi) / P(rh\ ct )] as a 
function of temperature; fee is the Boltzmann constant. 
The normalizing reference value P(rh\) is chosen such 
that the arbitrary value at the minimum of the free en- 
ergy profile is set to zero for convenience, see Figure 14(e)] 
At 280 K, the free energy shows two minima, separated 
by a small barrier, related to both open (broad global 
minimum at fh\ « 5 A) and closed (local minimum at 
rhi w —20 A) structures with the latter about 3 kJ/mol 
higher in energy then the former. By 330 K the mini- 
mum associated with the closed structure, as well as, the 
barrier become much lower in relative free energy thus 
the entire profile of AF broadens out into a flat poten- 
tial energy landscape. Above 330 K the free energy of 
the closed minimum gradually increases again and even- 
tually disappears by 370-390 K. Performing an identical 
analysis for the remaining modes provides approximately 
harmonic potentials of mean force, see Figure 4(f) for the 
potential derived from m 2 , in agreement with our use of 
only ih\ as an order parameter. The trend in AF(rhi) 
reflects the presence of the temperature regimes observed 
from the average parameters, i.e. increased folding from 
280-320 K, the observation of maximum folded struc- 
tures around 330 K, and the unfolding at T > 340 K. 
The presence of two minima in AF(fhi) in conjunction 
with the temperature-induced changes naturally explains 
the existence of the isosbestic and isodichroitic points ob- 
served in FT-IR and CD experiments [25|, respectively, 
and gives us confidence in the reliability of our simula- 
tions. 

At 330 K, fluctuations and thus statistical errors are 
larger then at other temperatures due to the broader 
configuration space which must be sampled possibly in 
conjunction with slower relaxation. This effect is similar 
to the "slowing-down problem" occurring for statistical 
convergence at second- order phase transitions where all 
quantities, and in particular the order parameter itself, 
undergo large fluctuations. However, the finite system 
size does not allow for a true phase transition with a 
well defined transition temperature but rather one ob- 
serves, in agreement with spectroscopic and thermody- 
namic measurements 0, l24l |25| the two temperature 
regimes which are bracketed about the temperature of 
maximal folding, T « 330 K. 

The dynamical motion of the peptide itself can be ana- 
lyzed by examining the time-dependence of the reaction 
coordinate fh\ as described by the correlation function 
c m (i) = (mi(O)mi (i)) / (mi(O)mi(O)), see Figure 5(a 



This time auto-correlation function may be well approx- 
imated in the short time limit, t < 100 ps, by an expo- 
nential function, c m (i) ~ exp[— i/r m ], with an associated 
relaxation time r m of the peptide's opening and closing 
motion. The temperature dependence of this important 
dynamical parameter is depicted in Figure [5(b) For tem- 
peratures below 330 K, r m w 300 — 350 ps is roughly con- 
stant within the accuracy of our statistics. Above 330 K, 
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(b) 




(c) 



FIG. 5: Dynamics of the mi eigenmode: (a) Auto-correlation 
function c m (t) at 280 K (squares), 330 K (triangles), 360 K 
(diamonds) and 390 K (circles), (b) Relaxation time r m as 
a function of temperature; here the symbol size reflects ap- 
proximate error bars, (c) Peptide backbone quasi-harmonic 
entropy, S (down triangles), as a function of temperature; see 
text for definitions. 



there is a progressive speed-up (r m ~ 130 — 150ps) which 
reflects the steepening of the free energy surface in the 
region of closed structures, i.e. for rh\ <C 0. From 370- 
390 K, where the free energy surface contains essentially 
a single minimum, rh\ +5, with approximately con- 
stant curvature, the relaxation time levels off to a plateau 
value of about r m 150 ps. Temperature variations of r m 
thus reflect three dynamical scenarios corresponding to: 
folding, unfolding and unfolded, which is slightly differ- 
ent from the observations made on the average structural 
quantities. Note, however, that the last of the three tem- 
perature regimes represent a limiting case of the unfold- 
ing transition where there effectively no longer exists a 
mimimum for closed structures. These findings correlate 
well with larger peptide backbone fluctuations observed 
in NMR spectroscopy [27| for water swollen elastins. Un- 
fortunately, the most directly comparable NMR measure- 
ments of 13 C relaxation times for poly(GVPGV) in so- 
lution are only obtained for the temperatures below the 
ITT [2^] . None the less, the authors report a mean ef- 
fective correlation time for polymer motion on the order 
of about 500 ps at 298 K in astonishing agreement with 
the 300 ps time scale reported here at the same temper- 



ature. These facts suggest that temperature-dependent 
measurements of NMR 13 C relaxation times for this small 
system may serve as an independent verification of our 
findings. 

Previous MD studies on much larger poly(GVPGV) 
chains find qualitatively that the peptide becomes 
"slightl y m ore dynamic" or less rigid at higher temper- 
atures [1J|. This also matches our quantified speed- 
up of the dynamics of the peptide's backbone opening 
and closing motion. Thus despite the increase in num- 
ber of peptide/peptide HBs, there must be an overall 
increase of freedom in the motion of the peptide back- 
bone. The question then arises: does this extra motion 
lead to favorable entropic contribution to the ITT from 
the peptide? To address this, we consider the entropy 
change of the peptide backbone motion as estimated 
from a quasi-harmonic approximation based on the prin- 
cipal component analysis [33, H3- Note, this analysis 
is based on a quasi- harmonic approximation [37l l38| . 
whereas the lowest-order mode, mi, which contributes 
about 40% to the overall variance of the peptide back- 
bone, is strongly anharmonic and even bimodal at low 
temperatures. However, its contribution to the total pep- 
tide backbone entropy is fairly small with much larger 
contributions arising from the more harmonic higher- 
order modes. Figure 5(c) shows the entropy of the pep- 
tide in a united-atom representation (see Appendix IA 3|) 
as a function of temperature. We find that despite the 
volume contraction of the peptide below 330 K the en- 
tropy is increasing with rising temperature, i. e. the en- 
tropy of the peptide itself increases upon folding. The 
associated entropy change is also consistent in magni- 
tude with the experimentally measured entropy increase 
of 0.15 kJmor 1 K" 1 and 0.11 kJmor 1 FT 1 (at 298 K 
and 1 bar) obtained from a van't FToff analysis of equilib- 
rium constants of both CD and FT-IR data [1 I M l25 | . 
These findings arc similar in spirit to various librational 
entropy models employed to explain viscoelastic proper- 
ties of elastin and its synthetic mimetica 0, 0, |g| . Our 
analysis does not, however, eliminate the possibility that 
stabilizing entropic terms from the water are also im- 
portant contributors to the folding behavior. Neverthe- 
less, our simulations clearly demonstrate that our elastin 
mimic is a very dynamical entity and that the thermo- 
dynamic consequences of this motion, in particular the 
increasing entropy, are a key component in understand- 
ing its structural transitions. 



C. Hydrogen Bond Dynamics and Kinetics 

In order to address the role of water and in particu- 
lar the influence of hydrogen bonding on the peptide's 
conformational transitions we analyze the dynamics of 
the HB network. To this end, we emplo y th e intermit- 
tent HB auto-correlation function pi itol lil lil lil Eif . 
c(i) = (h(Q)h(t)} I (h). As usual the HB population vari- 
able h(t) is defined to be unity, h(t) — 1, if a particular 
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HB exists at time t and zero h(t) = otherwise; the 
criteria to define a HB are compiled and discussed in 
Appendix I A 41 and the average (. . .) is taken over all HBs 
that were present at time t — 0. This function describes 
the probability that a HB, which was intact at t = 0, is 
intact at time t independently of possible breakings (and 
reformations) in the interim time. Similar to the average 
number of HBs in Sect. Ill A"l the following classes were in- 
troduce d: HB s between water molecules in the bulk c w (t) 
(Figure |6(a)| ) , between first s hell p rotein solvation water 
and bulk water c sw (t) (Figure 6(b) I, between the pe ptide 
and first shell solvation water c ps (t) (Figure |6(c)|), and 



direct HB contacts of the peptide with itself c pp (f) (Fig- 



ure 6(d) I. These functions provides us with a wealth of 
insights into the complex dynamics and the associated 
time scales of the solvated peptide in relation to e.g. re- 
laxation dynamics in bulk water. Comparison of Cm(t) 
and c ps (i) nicely shows the well known trend |45lE6l E7| 
of slower dynamics for interfacial peptide water HBs then 
those within bulk water: depending on the temperature 
c w (i) and Cp S (i) decay to zero between 10-20 ps and 20- 
50 ps, respectively, at temperatures below 330 K. At the 
highest temperatures both functions decay to zero within 
10 ps with the most pronounced speed-up of this process 
occurring at the peptide/water interface. Conversely the 
function c sw (t) decays to zero in only a few picoseconds 
at all temperatures indicating that only fast HB break- 
ing dynamics occurs for this class of bonds. A curiosity 
is the function c pp (i), which instead of decaying to zero 
still retains a finite value for upwards of 100 ps or more. 
This behavior arises from the fact that peptide/peptide 
HBs, once broken cannot diffuse away from each other 
as they are formed involving the peptide backbone, i.e. 
they have a finite probability to reform even after long 
elapsed times due to topology. 

To extract further information on the processes occur- 
ring on these various time scales we make use of a more 
elaborate correlation function [iol El El El El Esj. 
n(t) = {h(0)[l - h(t)]H(i)) / (h), This function n{t) mea- 
sures the conditional probability that a HB, which was 
intact at t = 0, is broken at time t > given that the 
donor / acceptor pair that established the HB at t = 
is still close enough at time t > to potentially form 
again the HB. The function H(t) is unity only if the HB 
pair distance is smaller than a cutoff distance of 3.6 A, 
which is the second nearest neighbor shell of the donor / 
acceptor pair radial distribution function. Thus, n(t) al- 
lows one to focus on the dynamics for t > 2 ps where 
the HB dynamics is more complex due to its coupling to 
translational diffusive motion [ifl Ell E8| . whereas the 
time domain for t < 2 ps is dominated by fast librational 
and possibly vibrational motion of water molecules. This 
function plays a significant role for two types of HBs con- 
sidered in this work: HBs between water molecules in 



the bulk, n w (t) (Figure 7(a) I, and HBs between the pep- 
tide and water molecules in its first solvation shell, n ps (t) 
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FIG. 6: Temperature and time dependence of various HB 
auto-correlation functions c(t): HBs between (a) water 
molecules in the bulk c w , (b) first shell solvation water and 
bulk water c sw , (c) the peptide and first shell solvation water 
c ps , and (d) direct HB contacts of the peptide with itself c pp ; 
see text for definitions. The choice of contour lines is coded 
in each panel as (a, b, c) where a and c denote the lowest and 
the highest contours, respectively, and b defines the relative 
spacing. 



(Figure 7(b) I. In both cases n(t) increases from t = in 
view of the formation of non-hydrogen bonded pairs re- 



9 




5 10 15 
t/ps (0,0.05,1.0) 



TABLE I: Parameters obtained from Arrhenius analysis of 
rate constants obtained for HB dynamics according to Eq. Q. 
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FIG. 7: Temperature and time dependence of various HB au- 
tocorrelation functions n(t): HBs between (a) water molecules 
in the bulk n w and (b) the peptide and first shell solvation 
water n ps ; see text for definitions. The choice of contour lines 
is coded in each panel as (a, b, c) where a and c denote the 
lowest and the highest contours, respectively, and b defines 
the relative spacing. 



suiting from initial HB breaking, up to a maximum point 
and then decreases steadily to zero as the probability of 
reforming the HB decreases due to diffusion. Comparison 
of Figure 7(b) 'a) and (b) shows a more rapid decay for 



n w (t) than n ps (t), which is due to the slower dynamics 
of HBs at the peptide/water interface. 

Following previous studies 0, fill |4S| , we formulate a 
rate equation describing the kinetics of HBs as a com- 
bination of terms resulting from breaking and reforming 
HBs 



dc(t) 
dt 



kc{t) - kn{t) 



(1) 



where k is the rate constant of HB breaking and k is that 
for HB reformation subsequent to breaking; note that 
t — \ I k defines an average HB lifetime. For bulk wa- 
ter, see Figure 8(a) and Tabled both time constants k w 
and k w show a simple Arrhenius or classical transition- 
state-theory behavior, i.e. k — Aexp [-Ei/k B T], at all 
temperatures (with E^ « 7 kJ/mol and E^ » 5 kJ/mol). 
Thus, both HB breaking and reformation increase in rate 
constantly with temperature in the bulk, as expected for a 
simple thermal process. Note that the bulk water value of 
k w at 300 K is close to data from simulations of pure wa- 
ter at room temperature |4pL l4lL 143 . |4^ | whereas our value 
of k w is smaller by a factor of about one half. Although 



some of this discrepancy may arise from systematic er- 
rors as discussed in Appendix IA 41 it is more likely due 
to the difference in the dynamical properties of TIP3P 
water from SPC or SPC/E models (see Ref. 1491 and ref- 
erences therein) used in previous studies |40l l4lL lil . |4S| . 
In particular, TIP3P water has a larger diffusion coef- 
ficient |49j which allows the water molecules to diffuse 
away at a different rate after an HB breaking event. 

Although the bulk water behaves in an expected man- 
ner the same cannot be said for HBs that connect the 
peptide to its surrounding water shell. As with the aver- 
age structural parameters and the peptide backbone re- 
laxation time, both fc ps and k ps show two distinct Arrhe- 
nius temperature regimes bracketed around 330 K. Below 
320 K the activation energy E$ s ps 12 kJ/mol to break an 
interfacial HB is higher than that in the bulk, whereas 
above 330 K it is much lower (w 4 kJ/mol). Concur- 
rently, HB reforming at the interface: E^ s «11 kJ/mol 
is close to that for breaking, but at 330 K the apparent 
activation energy (E^ s « kJ/mol) vanishes. In terms of 
rates, see Figure |H1 both processes to break and reform 
HBs at the peptide/water interface effectively slow down 
upon heating (w.r.t. a linear Arrhenius-extrapolation of 
the low temperature behavior) once a critical threshold, 
about 320-330 K, is surmounted. Hence, the rate to make 
interfacial HBs essentially reaches a plateau value above 
330 K, whereas the one to break these bonds continues 
to grow with temperature, see Figure [S] 

This observation of a profound change in the dynam- 
ics of the peptide/water interface is in qualitative ac- 
cord with thermodynamic measurements 0, l24l |2f| . In 
particular, the behavior of the thermal expansion coeffi- 
cient also indicates an overall decrease in the "strength" 
of these interactions. This is also consistent with the 
proposed hydrophobic collapse which occurs for the ITT 
of elastins [lj]. However, the most intriguing aspect of 
this observation is that the peptide/water HB dynamics, 
which occurs on a time scale of 1-10 ps, mirrors the dy- 
namics of the peptide which occurs on time scales that 
are two orders of magnitude larger. 

We now investigate the dynamics of the HBs between 
interfacial water and bulk water as well as those involv- 
ing the bridging waters. For the latter class a correlation 
function Cb w (^) is introduced based on a HB population 
function hi, w (t) which is unity only if a water molecule 
is simultaneously bound to two or more different residues 
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FIG. 8: Arrhenius_plot of rate constants for HB breaking, k, 
and reformation, k, for HBs between (a) first shell protein 
solvation water and bulk water fc aw (down triangles), water 
molecules in the bulk fc w (open up-triangles) and fc w (filled 
up-triangles); (b) the peptide and first shell solvation water 
fcps (open diamonds) and k ps (filled diamonds) ; (c) the peptide 
with itself k pp (circles). The symbol size covers the error 
bars and the lines are linear (i.e. Arrhenius) fits; see text for 
definitions and Table for the resulting fit parameters. 



of the peptide. For the former of these two HB classes 
the second term in Eq. may be neglected due to the 
relatively rapid decay of c sw (t) to zero. We find that fc sw 
features essentially an Arrhenius behavior at all temper- 
atures with a similar value of E* w — 7 kJ/mol as bulk 
water, see Figure |8(a)| and Table [I] This picture is en- 
tirely in accord with the proposition that these surface 
water molecules have a hindered orientation which strains 
the HBs that are formed with bulk water. This does not 
allow for HB reformation after breaking, hence the pro- 
cess of fast HB reformation does not occur for this class 
of HBs. More surprising is that the correlation function 
of the bridging waters, Cb w (t), essentially decays to zero 
within 2 ps at all temperatures (not shown for that rea- 
son). For this small peptide it may thus be concluded 
from the dynamics that, although water-mediated pep- 
tide/peptide bridges ("internal water") do occur, they 
play no key stabilizing role for the folded state. As a 
corollary for the above assertion of interchange of HBs 
to be at the heart of the folding / unfolding behavior of 
elastin, the stability of the folded state must then come 
from the peptide/peptide HBs. 

We thus turn our focus to the function c pp which de- 



scribes the dynamics of the peptide/peptide HBs. Unfor- 
tunately, the poor statistics resulting from the few HBs 
that exist for T < 320 K, and the relatively poor conver- 
gence for long-time decay only allow for a partial analy- 
sis. Nonetheless, we are able to make several important 
observations. First, the short-time regime t < 4 ps can 
be used to obtain an approximation to k pp in a similar 
fashion as above, see Figure |H1 Below 330 K, k pp may be 
interpreted as having approximately Arrhenius behavior 
with « 8 kJ/mol, see Figure Efc). This barrier is 
similar to that for HBs between bulk water but less then 
those between the peptide and solvent waters; see Ta- 
ble H] for a detailed comparison. Above 330 K, where 
a larger number of peptide/peptide HBs improves the 
statistics of c pp (t), fc pp is again found to have an approx- 
imate Arrhenius behavior. At these temperatures the 
peptide/peptide HBs exhibit a slightly lower activation 
barrier for breaking of £| p « 6 kJ/mol, which is larger 
than that for peptide/water HBs in the same tempera- 
ture regime, and will thus stabilize the folded state. The 
increase of thermal energy which allows peptide/water 
HBs to break more easily decreases the stability of the 
open state but has less effect on the folded state which 
is in part stabilized by peptide/peptide HBs. Second, al- 
though the long time decay of the function has too much 
numerical noise to provide a reliable fitting it is evident 
that the probability to recover the same HB pair de- 
creases rapidly with increasing temperature above 330 K, 
see Figure |6(11)1 This is in accord with the above obser- 
vation of a speed-up of the peptide's backbone motion 
in this same temperature regime. Increasing fluctuations 
of the peptide backbone above 330 K result in break- 
ing peptide/peptide bonds more rapidly and decreasing 
the probability with which they reform. This effect will 
counter the stabilization from peptide/peptide HBs for 
the folded state and will eventually lead to the unfolding 
at higher temperature. 



III. CONCLUSIONS 

The presented study is successfully able to reproduce 
the inverse temperature transition of a minimal elastin 
model at about 40-50°C. Additionally, an unfolding tran- 
sition is identified at temperatures approaching the boil- 
ing point of water. Our simulations reproduce the key 
observations obtained from FT-IR, CD, DSC, and PPC 
experiments in the companion study 25] of the same wa- 
ter solvated octapeptide. Due to the small size of the 
system two broad temperature regimes are found both 
in experiment and in the simulations: the "ITT regime" 
(at about 10-50°C) and the "unfolding regime" at about 
T > 60° C where the structure has a maximum proba- 
bility of being folded at T « 50° C. A detailed molecular 
picture involving a reaction coordinate and a free energy 
profile for this process is presented along with a thorough 
time-correlation function analysis of the hydrogen bond 
dynamics and kinetics within the peptide as well as at the 



11 



peptide/water interface. In particular, the two regimes 
are characterized by changes in a dynamical two-state 
equilibrium between open and closed conformations and 
a change of the HB dynamics involving interfacial water 
molecules. 

In a nutshell, our data suggest a simple qualitative pic- 
ture of the observed transitions. At low temperatures, 
a relatively strong peptide/water interaction stabilizes 
open conformations relative to closed ones. The increase 
of thermal energy however decreases the stability of the 
extended state but has less effect on the folded conforma- 
tion which is in part stabilized by peptide/peptide HBs. 
Thus, there is a shift in equilibrium and one observes an 
increase in folded structures and hence the ITT. A second 
important contributing factor is the increase of peptide 
backbone fluctuations above the ITT and the resulting 
entropic stability. However, the large librational ampli- 
tude motion of the peptide backbone can provide suffi- 
cient thermal excitation to break these peptide/peptide 
HBs at higher temperatures which ultimately leads to a 
second, unfolding, transition. 

The current work, based upon a temperature scan us- 
ing 32 ns simulations of an octapeptide, gives further 
support to the conclusions of previous studies employing 
much shorter MD simulations on longer poly(VPGVG) 
peptides [3 |H |H H3 ■ As in these studies we also ob- 
serve a decrease in the peptide/water interactions around 
the ITT and a speed-up of the peptide backbone dynam- 
ics above this transition. Although our longer simulation 
times do qualitatively agree with the findings of this ear- 
lier work the present study is able to provide a more quan- 
titative analysis of the dynamical processes associated 
with the peptide structural transitions. Novel findings re- 
lated to the inter-relation of of peptide/water HB dynam- 
ics and peptide librational entropy, which are comple- 
mentary aspects of the ITT and unfolding transitions as 
opposed to being alternate models, are only possible due 
to these quite long MD trajectories. Thus, the current 
work emphasizes the importance of detailed statistical- 
mechanical analysis in constructing atomic level pictures 
of complex biomolecular processes. 

The limitation of the current approach however, is that 
it is not at present clear how all of the factors, which have 
been isolated as key ingredients in the peptide structural 
transitions, will apply to large polypeptides. Although it 
is highly likely that decrease in peptide/water HB inter- 
actions and peptide librational entropy will remain im- 
portant ingredients the relative weights of the two factors 
in larger polymers is unknown. Most notablythe ener- 
getic/entropic contribution of bridging waters in larger 
polymers where the folded peptide may be truly said to 
have an "inside" is currently an open issue. To address 
these questions, future combined experimental and theo- 
retical work on GVG(VPGVG)„ with n = 2 and 3 includ- 
ing mutant structures as well as tropoelastins is currently 
underway. 
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APPENDIX A: THEORETICAL METHODS 
1. System 

The system consists of an eight amino acid oligopep- 
tide of sequence GVG(VPGVG) capped by methylamine 
(-NH-CH 3 ) and acetyl (-CO-CH3) groups at its C- and 
N-termini, respectively. In accordance with the exper- 
imental finding that VPGVG is the minimal repeat 
unit necessary for observing the ITT irrespective of the 
end groups, the caps are not expected to change the ba- 
sic scenario as found in our complementary experimental 
study on the zwitter ionic species 0> HH j further checks 
arc discussed below. The octapeptide is solvated with 
2127 waters and its center of mass is fixed to the cen- 
ter of a 50 A diameter spherical droplet surrounded by 
stochastic boundary conditions [Hcf. This sphere diame- 
ter allows us to maintain at least three to four molecules 
between the peptide and the stochastic boundary even for 
the configurations where elastin is at its maximum elon- 
gation of about 27 A. We employ the all atom Charmm 
force field 51] for the peptide and the TIP3P -52j water 
potential. 



2. Simulations 

Within the EGO molecular dynamics package [U HH 
a weak Berendsen thermostatting scheme (coupling time 
constant of 100 fs) was used to control the temperature 
using a 1 fs MD time step and the Shake algorithm 
for fixing the bonds between hydrogen and heavy atoms. 
This conservative choice of parameters for a careful in- 
tegration of the equations of motion has allowed us to 
perform stable simulations which included equilibration 
times of 2-5 ns (depending on temperature) prior to ob- 
taining 32 ns trajectories at 12 temperatures between 280 
and 390 K. An additional run was performed at 400 K 
(not shown) in order to stretch the temperature range 
as much as possible beyond the boiling point of water. 
For many properties, such as HB breaking and refor- 
mation rate constants, the same trend as obtained from 
"extrapolation" of the data from 340 to 390 K was ob- 
served. However, in particular average structural param- 
eters showed deviations from this trend which we inter- 
pret tentatively as failures of the simulation approach. 
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Note that 400 K exceeds by about 100 K the tempera- 
ture range for which the employed force fields for both 
water and the peptide are optimized, i.e. ambient con- 
ditions. As a convergence check an additional 8 ns run 
was obtained at 280 K in order to test the reliability of 
both average and dynamical properties of the protein; 
the results at 280 K are quoted for the 32 ns MD run 
for statistical consistency. Internal pressures inside the 
droplet were found to be less than about 0.1-0.2 kbar 
depending on the temperature, which is well below the 
pressure of the order of many kbars that is required to 
suppress protein folding entirely [2^, |55| . 

In order to cross-check the possible influence of the 
capping groups we have computed two 25 ns trajectories 
at 280 and 320 K in larger droplets (60 A diameter with 
3700 water molecules) for both the capped and zwitteri- 
onic peptides. We obtained qualitatively identical results 
to those presented here indicating that the ITT is not sig- 
nificantly affected by the presence of the caps, which is in 
line with experimental findings [3, nor by the spherical 
boundary conditions for the chosen size of the solvating 
droplet. 

3. Principle component analysis 

Principal component analysis (PCA) is a standard pro- 
jection method in statistical data analysis, feature extrac- 
tion, and data compression [3(| . Given a set of multivari- 
ate data, the purpose is to find a smaller set of variables 
with less redundancy reproducing the original data as 
well as possible. The redundancy is measured by corre- 
lations between the data elements. 

The starting point in PCA is a random vector x with 
n elements and a sample x' 1 ', . . ., x^ L ' from this vector. 
No assumptions on a generative model or the probability 
density of the vectors are made as long as the first- and 
second-order statistics can be estimated from the sample. 

First of all the vector x is centered 

x <— x- < x > (Al) 

and the mean < x > is estimated from the available 
sample. Geometrically this transformation corresponds 
to a translation of the entire sample so that the mean is 
now at the origin. Next consider a linear combination 

n 

rhi = ^ wi k x k = Wj x , (A2) 

k=l 

where the index goes over all elements of x. The 
weights Wii,. . . ,Wi k are real numbers, elements of an n- 
dimensional vector wi and w^f denotes the transpose of 
wi. The factor rhi is called the first principal component 
of x, if the variance of fh\ is maximally large. Therefore 
one has to maximize the second moment 

< m\ >=< (wfx) 2 >= wj < xx T >wi = wf C x wi 

(A3) 



subject to the normalization condition 

|w 1 | = (wfw 1 ) 1 /2 = i , (A4) 

where | • • ■ | denotes the euclidian norm 

|wi| = (wfwO 1 / 2 (A5) 

of wi and C x is the n x n covariance matrix of x. The 
maximization problem can be rewritten 

(C x - AI)wi = (A6) 

or 

det(C x - AI) = (A7) 

which means that A is an eigenvalue of C x and that Wi 
is the corresponding eigenvector 

wi = nil ■ (A8) 

Therefore the first principal component is a projection 
of the vector x onto the unit length eigenvector of it's 
covariance matrix having the largest variance, i.e. eigen- 
value. 

This solution can be generalized to n principal compo- 
nents where the i-th principal component is 

71 

rhi = ^ m lk x k = mf x (A9) 
fe=i 

with 1 < i < n and the eigenvectors are ordered in 
descending order of the magnitude of their eigenvalues 
Ai > A2 > ■ • • > A n . Thus the original data set can be 
fully reproduced by the expansion 

n n 

^2 rriiiiii = 2J mf x m { . (A10) 

i=l i=l 

The hope is now that by considering just the first few 
principal components most of the data can be represented 
as 

n j<n 
X ^ ^ TfljTU-i ~ ^ y — ^approx i.Q. X a pp rQ x ~ X . 

i=l i=l 

(All) 

In the case of analyzing MD trajectories the variable x 
can be an 3iV-dimensional vector consisting of the carte- 
sian coordinates of the TV atoms under consideration in 
the form 

x= (xi,yi, zi, x 2 , V2,Z2, ■■■ ,x N ,y N ,z N ) (A12) 

and the sample x^, . . ., x'^ is given by a MD trajectory 
of finite length L, see e.g. Refs. y4],|35|]. Before perform- 
ing the PCA itself the data set must be corrected for over- 
all translation and rotation. After centering according to 
Eq. IA1I the overall rotation of the considered system is 
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subtracted. This can be done by rotating each configura- 
tion to a reference configuration by a least squares fit [56| • 
This translation-rotation correction will thus provide six 
eigenvalues equal to zero, which do not refer to the purely 
internal motion. 

Since the eigenvectors of the covariance matrix are mu- 
tually orthogonal, i.e. uncorrelated, the motions in the 
direction of each component, i.e. along the corresponding 
eigenvector, can be interpreted as a "mode" . To visualize 
the motion of each mode one has to calculate the coor- 
dinates including the mean of x and the fluctuations of 
the positions described by the principal component under 
consideration 

xf } =rh\ l) uii+ < x > (A13) 

where the index i denotes the i-th principal component 
and I the l-th timestep of the trajectory of total length 
L. Note that this yields a "caricature" of an important 
dynamical motif hidden in the full dynamics |34j | . 

For the PCA carried out here, all heavy atoms of the 
peptide backbone were included. In addition, a united- 
atom representation was introduced for each side chain 
by assigning a mass equivalent to that of all its consti- 
tuting atoms which was located at its center of mass. 
This procedure effectively reduces the number of atoms 
to N = 38 and thus simplifies the analysis and its rep- 
resentation. In line with previous findings |57j it was 
confirmed that this approximation does not lead to any 
significant alteration of the results. Tests indicate that 
PCA including all atoms provides an essentially identi- 
cal picture for the low-order modes that are of interest. 
The dependence of the resulting motions upon trajectory 
length was carefully evaluated and results were found to 
be well converged for simulations on the order of 20 ns 
in duration. For example, the dot product between the 
slowest motion, i.e. {nil} computed over two distinct 
20 ns trajectories and that from a 30-40 ns trajectory at 
the lowest temperature, 280 K, is about 0.98 therefore 
indicating that the modes are essentially converged. Fi- 
nally, the modes discussed in this investigation are found 
to exist also when using the zwitterionic octapeptide and 
in the simulations with a larger solvation sphere around 
the peptide as discussed in Appendix I A 21 



and k w , is sensitive to the actual choice of the parame- 
ters of the definition and was found to vary as much as a 
factor of 2-3 between different definitions. However, we 
always obtained the same qualitative trends that both 
HB numbers and rate constants (k w and k w ) feature a 
linear behavior with respect to temperature. Similarly, 
for peptide/peptide and peptide/ water HBs the same cri- 
terion based on distances (i?AH < 2.6 A where the proton 
acceptor A is either an O atom of a water molecule or of 
a peptide carbonyl group) and angles (Zd-h - a > 130° 
with the proton donor D being either water O or pep- 
tide backbone N) was used. As a check we varied the 
distance to -Rah < 2.4 A keeping the angle cutoff unal- 
tered and performed the analysis with only the distance 
criterion. Again we found the same general behavior of 
two temperature regimes based on the dynamics of pep- 
tide/ water HBs and the temperature dependence of n pp 
and ribw to be preserved. In conclusion, although the 
HB definitions do change the reported absolute values of 
the HB numbers and rate constants as expected [j^llif . 
the qualitative behaviour remains the same and thus the 
trends in the temperature dependence for these quanti- 
ties are not affected. 



4. Hydrogen Bond Analysis 

Concerning the definition of HBs in water, and thus 
the population variable h(t), we employ a standard struc- 
tural criterion based on both the distance (i?oH < 2.6 A 
as obtained from including the entire first nearest neigh- 
bor peak in the intermolecular OH radial distribution 
functi on) and the angle, Zo-h-o > 130° for h(t) = 
1 EH E3, 13 [4^|. Asa check we also performed 

the analysis with only the distance criterion and yet again 
with the distance criterion but using a tighter angular 
criterion of Zo-h - o > 150°. Consistent with previous 
findings [43, 13 the absolute value of rate constants, k w 
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